Self Avoiding Paths Routing Algorithm in Scale-Free Networks 
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In this paper, we present a new routing algorithm called "the Self Avoiding Paths Routing Algo- 
rithm" . Its application to traffic flow in sclae-free networks shows a great improvement over the so 
called "efficient routing" protocol while at the same time maintaining a relatively low average packet 
travel time. It has the advantage of minimizing path overlapping throughout the network in a self 
consistent manner with a relatively small number of iterations by maintaining an equilibrated path 
distribution especially among the hubs. This results in a significant shifting of the critical packet 
generation rate over which traffic congestion occurs, thus permitting the network to sustain more 
information packets in the free flow state. The performance of the algorithm is discussed both on a 
Barabasi- Albert (BA) network and real autonomous system (AS) network data. 



The understanding of information flow 
throughout a complex network is an impor- 
tant issue from both a theoretical and a prac- 
tical point of view. The goal of such un- 
derstanding is to control the traffic in com- 
plex systems such as the internet. Indeed, 
the main problem with these systems is the 
emergence of congestion when the traffic load 
becomes higher than a certain threshold. To 
solve this problem, studies focus on the mech- 
anism of routing of information packets to 
reach their destinations. In this respect, 
the well known shortest path protocol directs 
the traffic more likely towards the more con- 
nected nodes (hubs). Many alternative rout- 
ing protocols (static and traffic aware proto- 
cols) have been proposed in order to obtain an 
equilibrated traffic load between the different 
paths. The latter protocols need extra com- 
munication between the routers compared to 
the static ones which can easily be imple- 
mented. But the proposed static algorithms, 
did not, in general, use the optimal paths ex- 
plicitly. By introducing a self avoiding mech- 
anism between the paths during the process 
of their construction, we were able to devise a 
new routing protocol called: "the Self Avoid- 
ing Paths Routing Algorithm" which results 
in an optimal distribution of the paths and 
performs better than many of the previous 
protocols in raising the overall network ca- 
pacity. 
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I. INTRODUCTION 

The routing problem represents one of the central ques- 
tions in today's internet traffic engineering [l]-_5|. Differ- 
ent routing protocols lead to different outcomes concern- 
ing the traffic flow throughout the network. In this re- 
spect, we can distinguish three classes of routing schemes 
[Q-Q. The first class uses a fixed routing strategy where 
the paths depend only on the physical structure of the 
network and are stored in a fixed routing table of each 
router. The second class consists of dynamic routing pro- 
tocols where the decision to send a packet through a given 
link depends only on the traffic state. Finally, in the 
third class, both the physical structure and the dynamic 
traffic information are used in order to take routing deci- 
sions [lc3 - [T2T ] . First methods present many advantages in 
terms of economical and technical costs compared with 
traffic aware methods and are widely used in mid and 
small systems [l3j . 

The traffic flow in communication networks is charac- 
terized by a phase transition from the free flow state, 
where the packet generation rate is balanced by the 
packet delivery rate, to the congested state where pack- 
ets accumulate rapidly in the network. This happens 
when the traffic generation rate R becomes greater than 
a threshold R c . The principal task of traffic engineering 
is to make R c as high as possible in order that the net- 
work could sustain much traffic load without congestion. 

In this respect, the shortest path routing (SPR) used 
for forwarding information packets [3, is not sufficient 
for this goal. Indeed, the structure of complex networks 
[l6j such as the internet 0. [l8[ is best described by an 
underlying scale-free 19-24] complex structure charac- 
terized by the existence of hubs (highly connected nodes) 
as well as peripheral nodes. Routing based on shortest 
paths often tends to send more information packets to- 
wards the hubs which rapidly become congested resulting 
in general congestion of the whole network. 

Other routing schemes have been proposed to over- 
come this difficulty. The so called efficient path routing 
(EPR) (ref.0) was used instead of SPR, with the cost of 
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a path is chosen to depend on the degrees of its nodes: 
Si (where fcj is the degree of node i and /3 is a control 
parameter). The best results are obtained with this al- 
gorithm for j3 = 1, though at the expense of the average 
packet travel time. On the other hand, in the hub avoid- 
ance (HA) routing[25j], the authors remove, in a first step, 
some hubs from the network which becomes a set of dis- 
connected clusters, and then build usual shortest paths 
between every pair of nodes in the same cluster before 
placing the hubs back and computing the paths between 
the remaining pairs of nodes. The results are found to be 
better than those obtained when using the shortest path 
(SPR) protocol. 

Although theses algorithms succeed in raising the crit- 
ical packet generation rate, they do not explicitly take 
into account the true optimal paths that should be used 
for routing. Another algorithm called " Optimal rout- 
ing" rQR)[26l [27| was proposed which tries to reduce the 
maximum node betweenness in a self-consistent manner 
reaching a near-optimal configuration of paths. Good 
results are obtained by this algorithm despite its slow 
convergence. 

In this paper, we propose a new algorithm which min- 
imizes self-consistently, the number of intersection nodes 
between the resulting paths. The results of this "Self 
Avoiding Paths Routing" (SAPR) algorithm show im- 
provements in comparison to the EPR presented in Ref. 
while at the same time reaching near-optimal path con- 
figuration in a relatively small number of iterations com- 
pared to the OR (Refs. |26| and 1271 ). In fact, the paths 
generated by our algorithm are not " self avoiding" in the 
strict sens; the SAPR algorithm tries to find self avoiding 
paths in an optimal sens, that is, paths that are as much 
as possible "self avoiding" or with minimal intersection 
points. 



II. MODEL 

The principal idea behind the SAPR algorithm is to 
execute iteratively the standard Dijkstra's algorithm for 
finding shortest paths between any given pair of source 
and destination nodes with the following main modifi- 
cation: instead of using the cost associated with each 
path as a fixed input for a Dijkstra's cycle, this cost is 
updated even during a cycle whenever a new path is dis- 
covered. This adaptive process permits us to take into 
account the previous situation in order to decide how to 
constract subsequent paths, by trying to avoid the exist- 
ing ones for the remaining path finding process during 
the same cycle. The output costs of one iteration are 
used as an input for the following one. 

A judicious choice for the cost associated with a path 
Vij from node i to node j can be chosen according to the 
following form: 

w(Vij) = w ^ ft 

uev tj 



where w(u) represents the cost associated with node u 
given by 

w(u) = [M p {u)] a (2) 

where Af p (u) is the number of paths found by the algo- 
rithm in the current step and traversing node u. a is 
an adjustable parameter. For a — 0, W{Vij) is just the 
number of nodes in the path Vij, and so we recover the 
shortest path algorithm. M p (u) is also calles the node 
betweenness (see ref. H^). 

The SAPR algorithm evolves just like the Dijkstra's 
one. We will not give the details of this algorithm, we 
will only show how to compute the costs associated with 
each node during the path construction process. In the 
following, we will call a "shortest path" a path that will 
have the lowest cost. 

Network nodes are visited in the same order used in 
the standard Dijkstra's algorithm. In order to explore 
all shortest paths starting from a given source node s to 
all the remaining N — 1 nodes, the algorithm starts at 
node s and records the "distance" d(v) from s to a any 
node v. This distance is given by the total cost of the 
nodes which constitute the path V sv : 

d(v) = J2 w ( u ) ( 3 ) 

Whenever a path with the lowest cost is discovered from 
s to a node v through an immediately preceeding node u, 
we need to store u as the antecedent of v: A s (v) — u. For 
the sake of simplicity, we will not cover the case where 
more than one shortest path is stored in the routing table, 
but will consider a unique path between any given pair 
of nodes. The generalization is straightforward. 

Suppose that the algorithm is now on the current node 
u, and that a neighbor v of u is checked (see Fig.[T]). The 
node v may have already a previously discovered lower 
cost path from source s: Vg V , through another node t 
with a corresponding distance: d t (y) = J^jev* W U)- S° 
we are faced with the three following situations: 

Case 1 : if d t (v) < d(u) + w(v), then the path through 
t still has the lowest cost, so nothing is done concerning 
node v in this step. 

Case 2: if d t (v) > d(u) + w(v), then the the distance 
to v from u is discovered as the new lower cost path. In 
this case, we need to update the number of paths passing 
through all nodes of the path V" v and the corresponding- 
costs by traversing the path backward from node v down 
to source node s and adding 1 to the number of paths 
of every node belonging to the path as is shown by the 
following pseudo-code: 

r 4— u 

while r ^ s do 

A/p(r) <- M p {r) + 1 
w(r) <- [N p (r)] a 
r «- A s (r) 
end while 
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In the other hand, all nodes of the previous path V\ v 
should be updated according to: 
r <- t 

while r ^ s do 

Afp(r) <r- Af p (r) - 1 
w(r) <- [A/" p (r)]« 
r <- A s (r) 
end while 

meaning that a path traversing the corresponding nodes 
is no longer used. So, before the updating, the path 
from node s to node v via node t has the lowest cost: 
Vsv = V\ v . After being updated, the path through node 
u is discovered as the new lower cost one: V sv = T™ v . 

Case 3: if d t (y) = d(u) + w(v), then the two paths 
through u and through t are of equal cost, so we choose 
either case 1 or case 2 randomly (with probability 1/2) 
in the case where we want to store in the routing table a 
unique shortest path between any pair of nodes. But in 
the contrary, the path through V\ v has to be stored as a 
shortest path as well. 




(a) (b) 

FIG. 1. Path discovery process of the SAPR algorithm, (a): 
The algorithm is in node u and tries to check for node v 
with an already existing path via t. (b): In this case, the 
path through u is found to have the lowest cost and the path 
through t is removed, so the node costs are updated for the 
two paths down to the source node s. The operations +1 and 
— 1 are to be performed on the number of paths on every node 
along the two paths (see the two pseudo-codes in text). 

The algorithm proceeds according to the following 
steps: 

1. Assign 1 to the cost of every node. 

2. Compute the shortest paths from a given source 
node to every destination node; and for every path 
found, update the number of paths for the corre- 
sponding nodes and compute the costs following 
the two pseudo-codes above. 

3. Repeat step 2 without initializing the number of 
paths nor the costs associated with every node but 
with using them as input fot the next cycle. 



III. SIMULATION RESULTS AND DISCUSSION 

For most of our numerical simulations, we used a 
Barabasi- Albert (BA) network (Ref. OJ) with TV = 1000 
nodes (except for fig. [3] where we used values up to 
N = 2000 and fig. [TO] where N = 200). This network 
is built starting with too = 3 fully connected nodes; and 
at each time step, a new node with to = 2 edges is added 
to the existing nodes with preferential attachment, that 
is with probability pi that depends on the node degree fcj 
of every candidate node: 



The network generated has an average node degree: 
(k) = 2m = 4. Furthermore, it has a power-law de- 
gree distribution characterized by the existence of highly 
connected nodes (hubs) and peripheral nodes. 

In addition, in order to test the proposed algorithm on 
different types of complex networks, we have done traffic 
simulations on a real autonomous system (AS) network 
data which consist of periodic snapshots of BGP routing 
table dumps [281 ] . For our purpose, we used a network of 
size N = 549 from the AS-733 dataset. 

Using the BA network with the SAPR algorithm, we 
first check for the number of necessary iterations to ob- 
tain the convergence of the algorithm. The average path 
length is calculated and its variation is reported in Fig. [5] 
in function of the number of iterations for a — 4.0, 6.0 
and 10.0 for the BA network. It can be seen that the al- 
gorithm quickly converges to a near-optimal value within 
just a few iterations. The simulations show that for small 
values of a, the convergence is very fast; for example, 
when < a < 3, a number of 15 iterations suffices while 
for 3 < a < 10, we need up to 50 iterations. How- 
ever, when a becomes larger (strong interacting paths), 
we need much more iterations, but in return, there is no 
substantial gain in terms of the performance as we will 
show later. 

In Fig. [31 we have plotted the average path length (L) 
as a function of the number of nodes in the BA network: 
N for a = 2.0,4.0 and 6.0. We can see that, even if (L) 
is a monotonically increasing function of N, the small 
world property [2 9j is maintained for different values of 
a, that is: 

(£)~logiV. (5) 

We used the SAPR algorithm to simulate traffic flow 
in the BA network where each node is treated as a host 
and a router, as follows: For each time step, we generate 
R packets from random sources and assign them ran- 
dom destinations, then put each of them at the end of a 
queue on the corresponding source. At each time step, 
every node can deliver at most C packets from the top of 
its queue towards their destinations (Here we set (7=1 
without loss of generality). The packets are navigated 
one step forward to their destinations by using the fixed 
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FIG. 2. Number of current packets in the network N cur as 
a function of the number of iterations Niter for the SAPR 
algorithm for different values of the parameter a (BA network 
with TV = 1000). 
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FIG. 3. Plot of the average path length (L) as a function of 
the number TV of nodes in the BA network for a — 2.0, 4.0 
and 6.0. 



routing table maintained at each router. If a packet's 
destination is found among the neighbors of the current 
node, it is directly sent to it; else, it is sent to the neigh- 
boring node with the minimal distance to the destination 
of the packet and is put at the bottom of its queue. Pack- 
ets reaching their destinations are automatically removed 
from the system. 

To characterize the traffic dynamics, we used the order 
parameter r\ given bv(30|: 
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At) - N p (t)} 
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(6) 



where N p (t) is the number of packets in the network 
at time t and (...) is the average over time windows of 



FIG. 4. Order parameter rj (given by Eq. [6j of the SAPR 
algorithm as a function of the packet generation rate R for 
different a values. The arrow indicates the position of the 
critical value R c ~ 67 for a — 10.0 (BA network with TV = 
1000). 



width At. In Fig. 21 we report the order parameter r\ 
versus R 7 the number of packets generated per unit time 
for a = 1.0,2.0,4.0,6.0 and 10.0. We can distinguish 
two phases, for R < R Cl where R c is a critical value, 
the system is in a free flow state where the number of 
generated packets is balanced by the number of packets 
delivered to their destinations and rj = 0. While for 
R > R c , a congested phase takes place on the network 
and (N p (t + At) — N p (t)) grows linearly with At and thus 
rj tends towards a constant value for fixed C and R values. 
It is clear that R c increases with a and we can conclude 
that the SAPR algorithm is very efficient for larger values 
of a. This result is consistent with the fact that when a is 
large, the paths tend to be as much separated as possible. 
So when the information packets use these paths, they, 
more likely, try to generate an equally distributed load 
among the different routers (regardless if they are hubs 
or peripheral nodes) resulting in more traffic load in the 
free flow state. 

In order to compare the performance of the SAPR al- 
gorithm (for a — 10.0) with the shortest path routing 
(SPR) and the efficient path routing (EPR)(Ref. 0) al- 
gorithms, we have plotted in FigfSl the variation of the 
average packet travel time (T) in function of R for the 
three algorithms which was found to have the same be- 
haviour as the order parametet rj and the same transition 
point at R c . We can see clearly the large value of R c for 
the SAPR algorithm compared with the others which is 
an indication that it is much more performant than SPR 
and EPR protocols. 

The average travel time is also plotted in Fig |6] for the 
algorithm: SPR, EPR and SAPR (with a = 5.0) for the 
AS-733 network. The results clearly confirms the fact 
that the SAPR algorithm shows its superiority not only 
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FIG. 5. Plot of the average travel time (T) as a function 
of the packet generation rate R for the algorithms SPR and 
EPR compared with the SAPR algorithm with a = 10.0 (BA 
network with N = 1000). 



FIG. 7. Variation of the average path length (L) vs. a. The 
values of (L) corresponding to the algorithms SPR and EPR 
are indicated for comparison by the arrows (BA network with 
N = 1000). 
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FIG. 6. Plot of the average travel time (T) as a function of 
the packet generation rate R for the algorithms SPR and EPR 
compared with the SAPR algorithm with a = 5.0, for the AS- 
733 Oregon Route Views dataset network with N = 549. 



for the theoretical BA networks, but also on real network 
data. 

Moreover, we found that the average travel time (T) is 
relatively lower for the SAPR algorithm even in the free 
flow phase for some range of a values. It is straight- 
forward to see that the average travel time is equal 
to the average path length (L) for low values of R: 
(L) = lirnR-^Ov^ 1 ) m the case where the sources and desti- 
nations of packets are both chosen at random. In Fig. [71 
we have reported (L) as a function of a. We can see 
that (L) increases to a limiting value corresponding to 
a — > oo. This is due to the fact that when a = 0, the 
SAPR protocol is equivalent to the SPR one and the 



FIG. 8. Critical packet generation rate R c as a function of a 
values. The arrows indicate the values corresponding to the 
SPR and the EPR algorithms (BA network with N = 1000). 



average path length has its absolute minimal value be- 
cause of the abscence of any "interaction" between the 
paths that are free of any constraint during their con- 
struction process. When a increases, however, the mu- 
tual avoidance between the paths becomes more impor- 
tant and they are more and more constrained to look 
for less occupied nodes and thus become longer and try 
to choose their optimal configuration where the average 
path length reaches its maximum. 

Although the average path length becomes somewhat 
longer, this is compensated by a larger value of the crit- 
ical packet generation rate R c (see Fig. [8]) which has the 
same behaviour as (L) with a low starting value for a = 
corresponding to the SPR algorithm: R^ PR w 4 to the 
value Rf APR w 67 for a = 10 which is larger even than 
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FIG. 9. Dependence of the number of paths N pa ths traversing each node on the node degree k for the three algorithms: SPR, 
EPR, SAPR (with a = 8.0) and EPR2 (BA network with N = 1000). 



the EPR algorithm: R^ PR » 37. For further greater 
values of a, the increase of R c is very weak, and the 
gain in performance for the traffic flow is not important 
compared to the large number of iterations necessary for 
convergence. 

Now we will try to explain the performance of the 
SAPR algorithm compared especially to the EPR one. To 
this end, we plotted in Fig.[9l the number of paths N pat hs 
traversing each node (or its betweenness) as a function 
of the node degree k. As can be expected, the SPR pro- 
tocol results in a large number of paths traversing the 
hubs compared to the peripheral nodes. In contrast, the 
EPR protocol succeeds in reducing the number of paths 
traversing the hubs while nodes with intermediate degree 
receive more paths. But we believe that this algorithm 
reduces the number of paths crossing the hubs more than 
necessary. Indeed, as the figure shows, the SAPR algo- 
rithm establishes a more equilibrated distribution of the 
paths among the hubs. 

To further clarify our argument, we have modified the 
cost in the EPR algorithm (w(i) — ki see Ref. 0) by 
reducing the cost of the hubs (having ki > k c ) with k c = 



15 for example, at the expense of the other nodes, by 
using the simple formula w(i) = ki for ki < k c and w(i) = 
kl.kl~ e otherwise (e is a parameter in the range [0, 1] 
and the factor k e c guarantees continuity). For e = 0.3, 
this algorithm (called EPR2) performs better than the 
standard EPR reaching R c rs 43 instead of 37 for the 
same BA network. This result was achieved by the fact 
that the paths distribution is more equilibrated than that 
of the EPR protocol (see Fig. [9j) , though not optimal like 
the SAPR protocol. 

In another respect, although the SAPR algorithm 
presents some similarities with the OR one, in the fact 
that they are both heuristic algorithms that try itera- 
tively to find a near optimal solution for the routing prob- 
lem by acting on the node betweenness, we will discuss 
here some differences between them. While the SAPR 
algorithm tries to redistribute paths across the whole 
network by acting on the node betweenness of all the 
network nodes in every move during every iteration of 
the algorithm, the OR protocol, in the other hand, tries 
to reshape the betweenness landscape by minimizing the 
maximum betweenness B max by updating it at the end 
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of each iteration. 

From a computational point of view, the OR protocol 
runs in 0(N 3 \ogN) (see RefsH3)). In the other hand, 
the SAPR is based on the Dijkstra's algorithm which runs 
in 0(M + NlogN) (if implemented with the Fibonacci 
heap) , in order to find all the shortest paths starting from 
a given source node, where N and M are, respectively, 
the number of nodes and the number of links in the net- 
work. The SAPR protocol uses the supplementing step 
of traversing the paths backwards to update their costs, 
so the time becomes 0{M(L) + N log N) ((L) is the av- 
erage path length) . Taking into account the fact that the 
paths are constructed for the N sites and that the simu- 
lation takes O(N) iterations, we conclude that the SAPR 
algorithm takes 0(N 2 (M{L)+N log N)) execution time. 

For networks that obey the small world property 
(Eq. [5|) , such as the B A network for example and the fact 
that M is proportional to N in this type of networks, we 
are left with the final running time of the SAPR algo- 
rithm: 0(N 3 log AT) in the worst case. Then, we can say 
that SAPR and the OR algorithms (see ref. 1271 ) perform 
in relatively the same computational time in the large 
networks limit. 

For a direct quantitative comparision of the two algo- 
rithms, we have plotted in Fig. [TU1 the variation of the 
average travel time for the SPR, the OR and the SAPR 
protocols for the BA network with N = 200 nodes. The 
results for the critical packet generation rates are respec- 
tively R S C PR ~ 5, R° R ~ 18 and R s c APR ~ 20, showing 
some advantage of the SAPR protocol. 

In the other hand, although the two algorithms have 
the same execution time in the large network limit, we 
have experienced a very slow convergence using the OR 
protocol wich needs up to 2000 iterations to obtain a 
satisfactory convergence for the maximum betweenness 
(for the BA network with 200 nodes). While to obtain 
convergence of the SAPR protocol, we need only ~ 15 
iterations (see for example Fig. [2]). One possible expla- 
nation to this difference is the fact that the OR is a form 
of extremal optimization which tries to reduce the quan- 
tity B max on a single node in every step, whereas the 
SAPR takes advantage of the path interaction on each 
node during every single iteration and uses this collec- 
tive information for optimization, thus reaching quickly 
an equilibrated path distribution landscape accross the 
network. 

We note here that we have done extensive traffic sim- 
ulation for the goal of explicit comparison between the 
performance of the different algorithms in real traffic sit- 
uation. But if the mere goal is to determine the tran- 
sition point from free flow to congested state, it can be 



achieved directly from the maximal betweenness B max 
through the relation (see ref I2BI ): 7 C = (N — l)/B max , 
where 7 C is the critical probability of packet insertion rate 
at a given node. 
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FIG. 10. Plot of the average travel time (T) as a function of 
the packet generation rate R for the algorithms SPR and OR 
compared with the SAPR algorithm with a = 6.0 for the BA 
network with N = 200 nodes. 



IV. CONCLUSION 

In summary, we have proposed a new routing algo- 
rithm called the Self Avoiding Path Routing (SAPR) al- 
gorithm where the paths are self-avoiding in the sens that 
the inetersection between them is reduced at its mini- 
mum. It uses the number of paths crossing each node of a 
given path as the cost associated with that path. Theses 
costs are found self-consistently by updating them during 
each step of the path discovery process of the original Di- 
jksta's algorithm. This fact results in a more equilibrated 
path distribution especially among the hubs. 

The direct result was the increase of the critical packet 
generation rate above which a jamming state occurs re- 
sulting in a much greater traffic load that can be sus- 
tained by the network without beeing congested. More- 
over, the average path length was found to be relatively 
small and obeys the small world property. 

The algorithm performs better than both the shortest 
path routing "SPR" and the efficient path routing "EPR" 
protocols. The former was shown to have the majority 
of paths concentrated on the hubs while the latter was 
shown to avoid hubs more than necessary. It was shown 
also that the SAPR algorithm takes the same compu- 
tational time as the OR protocol but this latter have a 
slower convergence. 
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